t MTERED 



RAW SEQUENCE LISTING DATE: 06/2 8/2002 

PATENT APPLICATION: US/09/9 81 , 286A TIME: 14:15:58 

Input Set : A:\26500260101.ST25.txt 
Output Set: N:\CRF3\06282002\I981286A.raw 

3 <110> APPLICANT: Watowich, Stanley J. 

4 Weaver, Scott C. 

5 Davey, Robert A. 

7 <120> TITLE OF INVENTION: Drug Discovery Methods 
9 <130> FILE REFERENCE: 265.00260101 

11 <140> CURRENT APPLICATION NUMBER: US 09/981, 286A 

12 <141> CURRENT FILING DATE: 2001-10-15 

14 <150> PRIOR APPLICATION NUMBER: US 60/240,187 

15 <151> PRIOR FILING DATE: 2000-10-13 
17 <160> NUMBER OF SEQ ID NOS : 36 

19 <170> SOFTWARE: Patentln version 3.0 

21 <210> SEQ ID NO: 1 

22 <211> LENGTH: 157 

23 <212> TYPE: PRT 

24 <213> ORGANISM: VENEZUELAN EQUINE ENCEPHALITIS VIRUS 
26 <400> SEQUENCE: 1 

2 8 Val Met Lys Leu Glu Ser 
29 1 5 

31 Lys lie Asn Gly Tyr Ala 

32 20 

34 Met His Val Glu Gly Lys 

35 35 

3 7 Thr Lys Lys Ala Ser Lys 
38 50 

4 0 Asn Met Arg Ala Asp Thr 
41 65 70 
4 3 Tyr Tyr Ser Trp His His 
44 85 
4 6 Thr Val Pro Lys Gly Val 
47 100 

4 9 Leu Asp Asn Gin Gly Arg 
50 115 

5 2 Glu Gly Ser Arg Thr Ala 
53 130 

55 Val Thr Val Lys Tyr Thr 

56 145 150 

58 <210> SEQ ID NO: 2 

59 <211> LENGTH: 11 

60 <212> TYPE: PRT 
C--> 61 <213> ORGANISM: ARTIFICIAL 

6 3 <220> FEATURE: 

64 <22 3> OTHER INFORMATION: Cell- perinea nt polypeptide 
6 6 <400> SEQUENCE: 2 




Asp 


Lys 


Thr 


Phe 
10 


Pro 


He 


Met 


Leu 


Glu 
15 


Gly 


Cys 


Val 


Val 
25 


Gly 


Gly 


Lys 


Leu 


Phe 
30 


Arg 


Pro 


He 


Asp 
40 


Asn 


Asp 


Val 


Leu 


Ala 
45 


Ala 


Leu 


Lys 


Tyr 


Asp 


Leu 


Glu 


Tyr 


Ala 


Asp 


Val 


Pro 


Gin 


55 










60 










Phe 


Lys 


Tyr 


Thr 


His 
75 


Glu 


Lys 


Pro 


Gin 


Gly 
80 


Gly 


Ala 


Val 


Gin 
90 


Tyr 


Glu 


Asn 


Gly 


Arg 
95 


Phe 


Gly Ala 


Lys 


Gly 


Asp 


Ser 


Gly 


Arg 


Pro 


He 






105 










110 






Val 


Val 


Ala 


He 


Val 


Leu 


Gly 


Gly Val 


Asn 




120 










125 








Leu 


Ser 


Val 


Val 


Met 


Trp 


Asn 


Glu 


Lys 


Gly 


135 










140 










Pro 


Glu 


Asn 


Cys 


Glu 
155 


Gin 


Trp 









/ 
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68 
69 



Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 



71 <210> SEQ ID NO: 3 

72 <211> LENGTH: 16 

73 <212> TYPE: PRT 

C--> 74 <213> ORGANISM: ARTIFICIAL 

76 <220> FEATURE : 

77 <223> OTHER INFORMATION: Cell-permeant polypeptide 
79 <400> SEQUENCE: 3 

81 Arg Gin lie Lys lie Trp Phe Gin Asn Arg Arg Met Lys Trp Lys Lys 

82 1 5 10 15 

84 <210> SEQ ID NO: 4 

85 <211> LENGTH: 16 

86 <212> TYPE: PRT 

C--> 87 <213> ORGANISM: ARTIFICIAL 

89 <220> FEATURE: 

90 <223> OTHER INFORMATION: Cell-permeant polypeptide 
92 <400> SEQUENCE: 4 

94 Arg Gin lie Lys lie Trp Phe Pro Asn Arg Arg Met Lys Trp Lys Lys 

95 1 5 10 15 

97 <210> SEQ ID NO: 5 

98 <211> LENGTH: 16 

99 <212> TYPE: PRT 

C--> 100 <213> ORGANISM: ARTIFICIAL 

102 <220> FEATURE: 

103 <223> OTHER INFORMATION: Cell-permeant polypeptide 
105 <4 00> SEQUENCE: 5 

107 Arg Gin Pro Lys lie Trp Phe Pro Asn Arg Arg Pro Lys Trp Lys Lys 

108 1 5 10 15 

110 <210> SEQ ID NO: 6 

111 <211> LENGTH: 525 

112 <212> TYPE: DNA 

C--> 113 <213> ORGANISM: ARTIFICIAL 

115 <220> FEATURE: 

116 <223> OTHER INFORMATION: Nucleotide sequence encoding tat-CCD 

118 <400> SEQUENCE: 6 

119 atgtacggtc gtaaaaaacg tcgtcagcgt cgtcgtgtca tgaaattgga atctgacaag 
121 acgttcccaa tcatgttgga agggaagata aacggctacg cttgtgtggt cggagggaag 
123 ttattcaggc cgatgcatgt ggaaggcaag atcgacaacg acgttctggc cgcgcttaag 
125 acgaagaaag catccaaata cgatcttgag tatgcagatg tgccacagaa catgcgggcc 
127 gatacattca aatacaccca tgagaaaccc caaggctatt acagctggca tcatggagca 
129 gtccaatatg aaaatgggcg tttcacggtg ccgaaaggag ttggggccaa gggagacagc 
131 ggacgaccca ttctggataa ccagggacgg gtggtcgcta ttgtgctggg aggtgtgaat 
13 3 gaaggatcta ggacagccct ttcagtcgtc atgtggaaca agcttggatc ttctctcgag 
135 ggagttaccg tgaagtatac tccggagaac tgcgagcaat ggtaa 

138 <210> SEQ ID NO: 7 

139 <211> LENGTH: 169 

140 <212> TYPE: PRT 

C--> 141 <213> ORGANISM: ARTIFICIAL 



60 
120 
180 
240 
300 
360 
420 
480 
525 
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143 


<220> FEATURE: 


























144 


<223> OTHER 


INFORMATION 


: Amino acid 


sequence of 


tat' 


- CCD 






146 


<400> SEQUENCE: 


■"7 
























148 


Met 


Tyr 


Gly 


Arg 


Lys 


Lys 


Arg 


Arg 


bin 


Arg 


Arg 


Arg 


Val 


Met 


Lys 


Leu 


149 


i 








c 
D 










1 A 

lu 










15 




151 


Glu 


Ser 


Asp 


Lys 


Thr 


Phe 


Pro 


Tin 

i le 


Mei. 


Leu 


C 1 n 
Lj-LU 




Lys 


lie 


Asn 


Gly 


152 








zi) 










o c 

ZD 










30 






154 


Tyr 


Ala 


Cys 


Val 


Val 


Gly Gly. 


Lys 


Leu 


Phe 


Arg 


Pro 


Met 


His 


Val 


Glu 


155 






o c 
3 D 










A A 
4 U 










45 








157 


Gly 


Lys 


lie 


Asp 


Asn 


Asp 


Val 


Leu 


Aia 


Ala 


Leu 


Lys 


Thr 


Lys 


Lys 


Ala 


158 




50 










55 










oU 










160 


Ser 


Lys 


Tyr 


Asp 


Leu 


Glu 


Tyr 


Ala 


Asp 


Val 


Pro 


Gin 


Asn 


Met 


Arg 


Ala 


161 


65 










70 










/ D 










80 


163 


Asp 


Thr 


Phe 


Lys 


Tyr 


Thr 


His 


Glu 


Lys 


Pro 


Gin 


Gly 


Tyr 


Tyr 


Ser 


Trp 


164 










85 










O A 










95 




166 


His 


His 


Gly 


Ala 


Val 


Gin 


Tyr 


pi,, 

GlU 


Asn 


biy 


Arg 


Fne 


Thr 


Val 


Pro 


Lys 


167 








100 










1 A C 

10 D 










110 






169 


Gly 


Val 


Gly 


Ala 


Lys 


Gly Asp 


Ser 


Gly 


Arg 


Pro 


lie 


Leu 


Asp 


Asn 


Gin 


170 






115 










"1 T A 
120 










125 








172 


Gly 


Arg 


Val 


Val 


Ala 


He 


Val 


Leu 


Gly 


Gly 


Val 


Asn 


Glu 


Gly 


Ser 


Arg 


173 




130 










135 




















175 


Thr 


Ala 


Leu 


Ser 


Val 


Val 


Met 


Trp 


Asn 


Glu 


Lys 


Gly 


Val 


Thr 


Val 


Lys 


176 


14 5 










150 










-ICC 

lJJ 










160 


178 


Tyr 


Thr 


Pro 


Glu 


Asn 


Cys 


Glu 


Gin 


Trp 
















179 










165 
























181 


<210> SEQ ID NO: 


8 
























182 


<211> LENGTH: 124 
























183 


<212> TYPE: 


PRT 


























184 


<213> ORGANISM: 


BOS 


TAURUS 




















186 


<4 00> SEQUENCE: 


8 
























188 


Lys 


Glu 


Thr 


Ala 


Ala 


Ala 


Lys 


Pne 


Glu 


Arg 


Gin 


His 


Met 


Asp 


Ser 


Ser 


189 


1 








5 










1 A 

10 










15 




191 


Thr 


Ser 


Ala 


Ala 


Ser 


Ser 


Ser 


Asn 


Tyr 


Cys 


Asn 


Gin 


Met 


Met 


Lys 


Ser 


192 








20 










n c 
ZD 










30 






194 


Arg 


Asn 


Leu 


Thr 


Lys 


Asp 


Arg 


Cys 


Lys 


Pro 


Val 


Asn 


Thr 


Phe 


Val 


His 


195 






35 










40 










45 








197 


Glu 


Ser 


Leu 


Ala 


Asp 


Val 


Gin 


Ala 


Val 


Cys 


Ser 


Gin 


Lys 


Asn 


Val 


Ala 


198 




50 










55 










60 










200 


Cys 


Lys 


Asn 


Gly 


Gin 


Thr 


Asn 


Cys 


Tyr 


Gin 


Ser 


Tyr 


Ser 


Thr 


Met 


Ser 


201 


65 










70 










75 










80 


203 


lie 


Thr 


Asp 


Cys 


Arg 


Glu 


Thr 


Gly 


Ser 


Ser 


Lys 


Tyr 


Pro 


Asn 


Cys 


Ala 


204 










85 










90 










95 




206 


Tyr 


Lys 


Thr 


Thr 


Gin 


Ala 


Asn 


Lys 


His 


He 


He 


Val 


Ala 


Cys 


Glu 


Gly 


207 








100 










105 










110 






209 


Asn 


Pro 


Tyr 


Val 


Pro 


Val 


His 


Phe 


Ala 


Ala 


Ser 


Val 










210 






115 










120 


















212 


<210> SEQ ID NO: 


9 
























213 


<211> LENGTH: 37 
























214 


<212> TYPE: 


DNA 
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C--> 215 <213> ORGANISM: ARTIFICIAL 

217 <220> FEATURE: 

218 <223> OTHER INFORMATION: Primer 

220 <400> SEQUENCE: 9 

221 gggaattcca tatggtcatg aattggaatc tgacaag 3 7 

224 <210> SEQ ID NO: 10 

225 <211> LENGTH: 42 

226 <212> TYPE: DNA 

C--> 227 <213> ORGANISM: ARTIFICIAL 

229 <220> FEATURE: 

230 <223> OTHER INFORMATION: Primer 
232 <400> SEQUENCE: 10 

23 3 gaattcggat cctcattacc attgctcgca gttctccgga gt 42 

236 <210> SEQ ID NO: 11 

237 <211> LENGTH: 6 

238 <212> TYPE: PRT 

C--> 239 <213> ORGANISM: ARTIFICIAL 

241 <220> FEATURE: 

242 <223> OTHER INFORMATION: A variable region amino acid sequence 

244 <22 0> FEATURE: 

245 <221> NAME/KEY: Variant 

246 <222> LOCATION: (1)..(6) 

247 <223> OTHER INFORMATION: Any amino acid 
250 <400> SEQUENCE: 11 

W--> 252 Xaa Xaa Xaa Xaa Xaa Xaa 
253 1 , 5 

255 <210> SEQ ID NO: 12 

256 <211> LENGTH: 477 

257 <212> TYPE: DNA 

258 <213> ORGANISM: VENEZUELAN EQUINE ENCEPHALITIS VIRUS 

260 <400> SEQUENCE: 12 

261 gtcatgaaat tggaatctga caagacgttc ccaatcatgt tggaagggaa gataaacggc 60 
263 tacgcttgtg tggtcggagg gaagttattc aggccgatgc atgtggaagg caagatcgac 120 
265 aacgacgttc tggccgcgct taagacgaag aaagcatcca aatacgatct tgagtatgca 180 
267 gatgtgccac agaacatgcg ggccgataca ttcaaataca cccatgagaa accccaaggc 240 
269 tattacagct ggcatcatgg agcagtccaa tatgaaaatg ggcgtttcac ggtgccgaaa 300 
271 ggagttgggg ccaagggaga cagcggacga cccattctgg ataaccaggg acgggtggtc * 360 
273 gctattgtgc tgggaggtgt gaatgaagga tctaggacag ccctttcagt cgtcatgtgg 420 
275 aacgagaagg gagttaccgt gaagtatact ccggagaact gcgagcaatg gtaatga 477 

278 <210> SEQ ID NO: 13 

279 <211> LENGTH: 43 

280 <212> TYPE: DNA 

C--> 281 <213> ORGANISM: ARTIFICIAL 

283 <220> FEATURE: 

284 <223> OTHER INFORMATION: Primer 

286 <400> SEQUENCE: 13 

287 agctaggaat tcggatccca tatgtacggt cgtaaaaaac gtc 43 

290 <210> SEQ ID NO: 14 

291 <211> LENGTH: 33 
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292 <212> TYPE: DNA 
C--> 293 <213> ORGANISM: ARTIFICIAL 

295 <220> FEATURE: 

296 <223> OTHER INFORMATION: Primer 

298 <400> SEQUENCE: 14 

299 ctagctaagc ttgttccaca tgacgactga aag 33 

302 <210> SEQ ID NO: 15 

303 <211> LENGTH: 36 

304 <212> TYPE: DNA 

C--> 305 <213> ORGANISM: ARTIFICIAL 

307 <220> FEATURE: 

308 <223> OTHER INFORMATION: Primer 

310 <400> SEQUENCE: 15 

311 ctagctgcgg ccgctcatta ccattgctcg cagttc 36 

314 <210> SEQ ID NO: 16 

315 <211> LENGTH: 47 

316 <212> TYPE: DNA 

C--> 317 <213> ORGANISM: ARTIFICIAL 

319 <220> FEATURE: 

320 <223> OTHER INFORMATION: Primer 

322 <400> SEQUENCE: 16 

323 agctagaagc ttggatcttc tctcgaggga gttaccgtga agtatac 47 

326 <210> SEQ ID NO: 17 

327 <211> LENGTH: 50 

328 <212> TYPE: DNA 

C--> 329 <213> ORGANISM: ARTIFICIAL 

331 <220> FEATURE: 

332 <223> OTHER INFORMATION: Primer 

334 <400> SEQUENCE: 17 

335 gatcctcgag agaagatccg gatccgttcc acatgacgac tgaaagggct 50 

338 <210> SEQ ID NO: 18 

339 <211> LENGTH: 51 

340 <212> TYPE: DNA 

C--> 341 <213> ORGANISM: ARTIFICIAL 
34 3 <220> FEATURE: 

344 <223> OTHER INFORMATION: Primer 

346 <400> SEQUENCE: 18 

347 gatcgaattc caccagcaga atcgacatat gtacggtcgt aaaaaacgtc g 51 

350 <210> SEQ ID NO: 19 

351 <211> LENGTH: 27 

352 <212> TYPE: DNA 

C--> 353 <213> ORGANISM: ARTIFICIAL 

355 <220> FEATURE: 

356 <223> OTHER INFORMATION: Primer 

358 <220> FEATURE: 

359 <221> NAME/KEY: misc_f eature 

360 <222> LOCATION: (15).. (16) 

361 <223> OTHER INFORMATION: A, T, G, or C 
364 <400> SEQUENCE: 19 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <2 20> 
to <223> fields of each sequence which presents at least one n or Xaa, 

Seq#:ll; Xaa Pos . 1,2,3,4,5,6 

Seq#:19; N Pos. 15,16 

Seq#:23; N Pos. 15,16 

Seq#:25; N Pos. 16,17 

Seq#:26; N Pos. 20,21 

Seq#:28; N Pos . 19,20,22,23,25,26,28,29,31,32,34,35 

Seq#:30; N Pos. 1,2,4,5,7,8,10,11,13,14,16,17 

Invalid <213> Response: 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#: 2, 3, 4, 5, 6, 7, 9, 10, 11, 13 , 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 2 5, 26, 27, 28, 29, 30 
Seq#:31,32,33,34,35,36 
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L:61 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 2 
L:74 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 3 



T 

JLj 


87 M:220 C: 


Keyword misspelled or invalid format/ 


<213> ORGANISM for SEQ ID# : 


A 


t 

J_i 


-L u u 


JYi 


• 000 
. ZZ\J 


p • 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


■f r\T 


D£iy 


lUtt 


* S 


T 
i_l 




jyi 


. 22 U 


p • 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


_L KJl. 


OL^ 


Tn# 
J. U ft 


. U 


T 


1 A 1 
141 


jyi 


• o o o 


p ■ 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


f rv"T 
J. ul 




J. U IT 


. 7 


T 
J_l 


2 ±D 


M 


• o o o 

. ZZU 


p ■ 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


-L \J±. 




ijjtr 


* Q 


T 
J_l 


22 1 


M 


: zzU 


P ■ 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


f or 




rnit 


. ±u 


1j 




rjt 

M. 


, oon 
: 22 u 


P ■ 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 




J. Jjff 




T 
J_l 


2D 2 


Ul 
JM 


• 74 1 

. J*il 


Ta7 ■ 


(46) "n" or "Xaa" used, for SEQ ID# : 11 after pos. :0 










T 
i_l 


zol 


"M 

M 


: 22 u 


r* . 

L. : 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


r or 




x 'n ±t 
j-ijff 




T 

JLl 


0 Q "3 


1UI 

M 


, zzU 


p • 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 




J. U TT 


• 1 A 


T 

1j 


*3 C\ 

JV J 


\A 

M 


: z/U 


L- . 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 




t n it 




T 

Li 


J X 1 


\A 

M 


: zz u 


P ■ 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


i or 








T 

Li 


1 0 Q 


M 


: 2 2 u 


P • 

u : 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


i or 




luff 




T 
ll 


O 4 J. 


M 


. o o n 
: zzU 


c : 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


ior 




T Pi it 
.LUff 


* 1 ft 


L 




\A 

M 


o o n 
zz U 


. 

C . 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 




± Uit 


. 1 Q 


T 
Li 


T £ C 
jDJ 


us 


"5/11 
J41 


Ta7 • 

w . 


(46) "n" or "Xaa" used, for SEQ ID#:19 after pos.:0 










T 

Li 


J /J. 


M 


oon 
zz U 


L. : 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


x or 




XlJff 


. z u 


L 




M 


zz U 


P • 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 




XlJff 


. 2 ± 


T 

Li 


one: 

,5 y p 


Vf 


z z u 


P ■ 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 




J. L/tt 


• 0 D 
. 2 2 


T 
Li 


/I fl "7 


M 


22 U 


p • 

o : 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 


QT?P 


TPiit 


. Z J 


t 
Jj 


y| 1 Q 

4 _l y 


101 

jyi 


"3/11 


W . 


(46) "n" or "Xaa" used, for SEQ ID# : : 


23 after pos. :0 










T 

Li 


4 /! D 


JYI 


ZzU 


P • 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


r or 


c x?r\ 


± Uff 


■ OA 
. Z ft 


T 

Lt 




1X1 


ZZU 


P • 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


J- UI 


GDA 


J. LJ TT 


. Z D 


T . 

J-J . 


449 


M 


341 


W: 


(46) "n" or "Xaa" used, for SEQ ID# : : 


25 after pos. :0 










L: 


455 


M 


220 


C: 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 


SEQ 


ID# 


:26 


L: 


467 


M 


341 


W: 


(46) "n" or "Xaa" used, for SEQ ID# : : 


26 after pos. :0 










L: 


473 


M 


220 


C: 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 


SEQ 


ID# 


: 27 


L: 


485 


M 


220 


C: 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 


SEQ 


ID# 


:28 


L: 


527 


M 


341 


W: 


(46) "n" or "Xaa" used, for SEQ ID# : : 


28 after pos. :0 










L: 


533 


M 


220 


C: 


Keyword misspelled 


or 


invalid 


format 


, <213> 


ORGANISM 


for 


SEQ 
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